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ABSTRACT 

We present a comprehensive data description for Ks-band measurements of Sgr A*. We characterize 
the statistical properties of the variability of Sgr A* in the near-infrared, which we find to be consistent 
with a single-state process forming a power-law distribution of the flux density. We discover a linear 
rms-flux relation for the flux-density range up to 12 mJy on a timescale of 24 minutes. This and 
the power-law flux density distribution implies a phenomenological, formally non-linear statistical 
variability model with which we can simulate the observed variability and extrapolate its behavior to 
higher flux levels and longer timescales. We present reasons why data with our cadence cannot be 
used to decide on the question whether the power spectral density of the underlying random process 
shows more structure at timescales between 25 min and 100 min compared to what is expected from 
a red noise random process. 

Subject headings: Galaxy: center — Techniques: photometric — Methods: statistical — Radiation 
mechanisms: non-thermal — Accretion, accretion disks — Black hole physics 



1. INTRODUCTION 

Sagittarius (Sgr A*) at the center of the our 
galaxy is a highly variable near-infrared (NIR) and 
X-ray source which is associated wit h a 4 x 10 6 M^ 

supermassiy e central bla ck hole (Eckart & G enzell 

199! [19971: Eckart et al .] EflOl Schodel et al. 2002; 
Eisenhauer et al.l 120031: IGhez et al.l [T998, 200QI l2005bL 
2008; Gilless en et al.l l2009f ) . While first detected as a 



bright, ultra compact, and comparatively steady radio 
source, the strong variability at shorter wave-lengths, 
the variable polarization of the NIR emission, and the 
correlation between fluctuations in the the sub-mm, 
NIR and X-ray regimes provide evidence that this vari- 
able emission originates in the direct surrounding of 
the black hole. Therefore, properties of the black 
hole and of the emission and accretion mechanisms in 
its close surrounding can be studied at these wave- 
lengths ([B aganoff et al. 2001; Por auet et aTll2003L 12001 
Genzel et al. 2003; Ghez et al. 2004, 2005a : Eckart et al. 
2004 l2QQ6aM d]bl. I2008a.b.c: Yusef-Zad eh et al.l l2006alE 
2007, 2008; Hornstein et al. 2007 Dodds-Eden et al. 
2001 ISltrnlTel^ 

To explain the observed variability and its correlation 
between the NIR and X-ray regimes several authors sug- 
gest Synchrotron Self-Compton (SSC) or inverse Comp- 
ton emission as the respo n sible radiation mechanisms 
(jEckart et al.l 12004 l2006allcl : lYuan et al.l 12004 iLiu et al.l 
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120061 : lEckart et all 120 121 ). Relativistic models that as- 
sume the variability to be linked to emission from single 
or multiple hot spots in the accretion disk near the last 
stable orbit of the black hole have been a pplied success- 
fully to individual datase ts (jMever et al.ll2006al E 120071 : 
IZamaninasab et aTl 120081 ). These models interpret the 
shorter timescales of the variability (between 10 and 30 
minutes) to be dominated by the timescales of the orbital 
motion 6 . Orbital motion close to the black hole (and an 
associated quasi periodic signal in the light curves) is of 
special interest: it could used as a timing experiment in 
a strong gravitational field that might allow for deter- 
mining black hole parameters like spin or inclination. 

Potential quasi-periodic oscillation s (QPOs) have bee n 
found in some light curves (see, e.g. JGenzel et al.l l2003). 
However, these QPOs are not statistically significant 
within the overall variability. Based on NIR light curves 
of 7 nights observed with Keck telescope iDo et aTl {2009) 
analyzed the flux density distribution of Sgr A* and the 
significance of quasi periodic oscillations. They showed 
that QPOs in total intensity light curves cannot be es- 
tablished with sufficient significance against random fluc- 
tuations, finding a pure red noise power spectral density 
sufficient to account for the time correlation of the fluc- 
tuations. 

O n the other hand , base d on relativistic mod- 
els, IZamaninasab et aTl ([20101 ) predicted a correlation 
between the modulations of the observed flux den- 
sity light curves and change s in polarime t ric da ta 
(also s ee lEckart et al.l 120063, lMgjiex^t"aTl l2006alE 
| 2007[ lEckart et al.l l2008ai ICunningham fc Bardeenl 
1973[ IStarkfc Conn olsJ 119771; Eb ramowic z et al.l 119911: 
Karas & Bao 1992; Hollywood et al. 1995; Dovciak et al. 
12004 120081 ). A comparison of predicted and observed 
light curve features (obtained from 6 nights of polarimet- 
ric observations with VLT and Subaru telescope) through 

6 rsj 20 minutes at the innermost stable orbit for Sgr A*. 
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a pattern recognition algorithm resulted in the detection 
of a signature possibly associated with orbiting matter 
under the influence of strong gravity. 

Since the first discove ry of variable emissi on of Sgr A* 
in the NIR in 2003 (jGenzel et all I2003D a number 
of publications concentrated on the statistical proper- 
ties of the flaring activity rather than on interpret- 
ing individual observations. These papers investigated 
the timing properties of the light curves as well as 
the r a diation mechanisms involv ed ([Yusef-Zadeh et al.l 
20091, [Za manina sab et all I2010L iBremer et al.l 120111 
Schodel et all 1201 U lEckart et all l2Q12h . On the base 
of 14 light curves observed between 2004 and 2008, in- 
cluding alte r natin g observations with VLT and Keck, 
iMever et al.l (j2009[ ) discovered first a dominant timescale 
at about 150 min, supporting linear scaling relations of 
break timescales in the power spectral density with the 
black hole mass. They determined the power-law slope 
of the high-frequency part of the power spectral density 
(PSD) to be 2.1 ±0.5. 

A comprehensive statistical approach in the anal- 
ysis of the Ks-band total intensity variability ob- 
served with NACO at th e VLT has been conducted by 
iDodds-Eden et al.l (|2Q11[ ). The authors analyzed VLT 
K-band data between 2004 and 2009. They presented a 
detailed investigation of the flux density statistics and 
described the time- variable stellar confusion at the posi- 
tion of Sgr A* that makes an investigation of the faint 
emission difficult. They also emphasized the importance 
of these faint states for the overall statistical evaluation 
of the variability of Sgr A*. Based on the flux density 
histogram the authors claim the evidence for two states 
of variability, a log-normal distributed quiescent state for 
flux densities < 5 mJy and a power-law distributed flar- 
ing state for flux densities > 5 mJy. They argue that 
it is very unlikely that the same variability process is 
responsible for both high and low flux density emission 
from Sgr A* (the statistical model the authors promote 
is sum marized in Append ix |A|) . With reference to this 
model iGenzel et al.l ((2010) state that "it is a key issue 
whether the brightest variable emission from Sgr A* are 
statistical fluctuations from the probability distribution 
at low flux or flare events with distinct properties" and 
that "the transition from the log-normal distribution at 
low-flux levels to the tail of high fluxes may also ex- 
plain the apparent mismatch between the detection vs. 
non- detection of quasi-periodic substructures in differ- 
ent near-infrared light curve studies" . 

Our statistical analysis presented in this paper serves 
the following goals: 

• to provide a more comprehensive, uniformly re- 
duced data set of Ks-band observations from 2003 
to early 2010; 

• to conduct a rigorous analysis of the observed flux 
density distribution; 

• to explain why a proper statistical analysis of the 
Ks-band light curves cannot repro duce the results 
found bv IDodds-Eden et al.l (|2011h : 

• to conduct a rigorous time series analysis on the 
base of a representative dataset; 



• to propose a comprehensive statistical model that, 
using standard methods for generating Fourier 
transform based surrogate data, describes all as- 
pects of the observed (total intensity) data and lets 
us simulate light curves with the observed time be- 
havior and flux density distribution; 

• and to investigate extreme variability events in the 
context of our statistics. 

2. DATA REDUCTION 

In the following, we describe the data and the reduc- 
tion methods we applied in order to obtain time-resolved 
photometric information on Sgr A*. Whereas large por- 
tio ns of the dataset a r e the same as used in the analysis 
bvlDodds-Ede n et al.l (|20lTI) . we have chosen different re- 
duction methods: Lucy- Richardson deconvolution in or- 
der to guarantee the best possible isolation of the target 
sources from nearby point-like sources, a well controlled 
flux density calibration with 13 stars, and an objective 
quality cut based upon seeing and Strehl ratio values. 

2.1. The dataset 

Our analysis is based on ESO archive data. All ob- 
servations have been conducted with the NIR adaptive 
optics (AO) instrument NAOS/C ONICA at the Very 
Large Telescope (V LT) in Chile (jLenzen et al.l 120031 : 
iRousset et al.l 120031 ). We included all available Ks-band 
frames of the central cluster of the Galactic Center (GC) 
from early 2003 to mid-2010. For all observations, the 
NIR wave front sensor of the NAOS adaptive optics sys- 
tem was used to lock on the NIR bright supergiant IRS 
7 (variable, raxs = 6.5 in the 1990s, mxs = 7.4 in 2006 
and m Ks = 7.7 in 2011, 5.6" north of Sgr A* 7 ). Two 
different cameras, S13 and S27, with 13" and 27" field 
of view 8 respectively, and a polarimetric mode with in- 
serted Wollaston prism and mask have been used. The 
small field of view of the polarimetric mode restricted 
the set of calibrators to the innermost arcsecond around 
Sgr A*. 

We concentrated on datasets with a length of more 
than 40 minutes (shorter datasets are often severely 
affected by bad weather conditions in which case the 
observer decided to change to another wavelength or 
target). Problematic frames with obviously bad AO- 
correction (most of the stars not visible at all) or frames, 
which do not show Sgr A* or a sufficient number of cali- 
brators, have not been included for the photometric anal- 
ysis. Ultimately we investigated 12855 frames photomet- 
rically. Table O shows a list of all datasets that are part 
of this analysis. 

2.2. Data reduction and flux density calibration 

We performed every reduction step for every frame uni- 
formly: The reduction included basic steps like sky sub- 
traction, flat fielding, and correction for bad pixels. For 
total intensity data, we used sky flat fields (where avail- 
able) , for polarimetric data a lamp flat field ([ Witzel et al.l 
|2011[ ). Most of the data was observed using a jitter rou- 
tine with random offsets to prevent systematic influences 

7 see lSchodel etall (l2010h . iQtTeTaLl (fl999h 

8 and accordingly with different pixel sampling of 13 mas/pix 
and 27 mas/pix 
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on the measurements by detector artifacts. These offsets 
need to be detected and corrected for to guarantee sta- 
ble aperture photometry at a constant position. This 
was achieved with a cross correlation algorith m for sub- 
pixel accuracy alignment (ESO Eclipse Jitter, iDevillardl 
1999). For each aligned frame, we determined an esti- 
mate of the point spread function (PS F) with the IDL 
routine Starfinder (jDiolaiti et al.ll2000l ) using separated 
stars like S30 or S65 in the 2" surrounding of Sgr A*. 
The PSF-fitting algorithm of Starfinder provided an es- 
timate of the extended background in each single frame 
and a list of detected stars with position and relative 
flux density. We decided not to use the values resulting 
from Starfinder photometry (for a detailed reasoning see 
below). Instead we used the Lucy- Richardson (LR) de- 
convolution algorithm to separate neighboring point-like 
sources. In the case of polarization data, we aligned all 
(four to six) polarization channels of the individual obser- 
vation night with the cross correlation algorithm. Data 
observed with the different pixel scale of the S27 camera 
(0.02715") were resampled to 0.01327" (S13). Finally a 
beam restoration was carried out with a Gaussian beam 
of a FWHM corresponding to the diffraction limited res- 
olution at 2.2/im (~ 60 mas). 




Fig. 1. — Ks-band image from 2004 September 3 0. The red cir- 
cles m ark the constant stars (see variability study in Rafelski et al. 
2007) which have been used as calibrators, the blue the position of 
photometric measurements of Sgr A*, comparison stars and com- 
pariso n apertures for background estimation. Source identifications 
from lGillessen et al.l (120091 ). 

After the described preparative steps we conducted 
aperture photometry at the p osition of 13 constant cal- 
ibrators (Raf elski et al.l 120071 ). of 6 comparison stars, at 
the position of Sgr A*, and at 8 positions where no star 
was detected (B- and C-apertures), to measure the back- 
ground flux (see Figure [I]). The background was esti- 
mated at the locations of lowest background (6 aper- 
tures) and close to Sgr A* (2 apertures) where no obvious 
point-like source is visible. We applied a circular masking 
of radius 0.04" at all the measurement positions. For a 
small number of observations, according to the available 
field of view, we accepted a smaller set of calibrators (at 
least seven). 

The positions of the apertures in each night have been 
defined as consistently as possible: For Sgr A* with the 



help of its brighter states, for the stars with the help 
of mosaics (averages over the single frames of one night 
in order to increase the signal to noise and to also esti- 
mate the centroid of the fainter comparison stars), care- 
fully following their proper motions. For the background 
apertures and the aperture of Sgr A* when it was faint 
we conducted triangulation relative to nearby stars. One 
set of positions then was used for all frames of the cor- 
responding night. For some polarimetric observations 
NACO was rotated. In these cases, we determined a ro- 
tation matrix for the position coordinates, making them 
comparable to the closest unrotated observations. 



TABLE 1 
List of calibrators 



star 


m Ks 


flux density 
[mJy] 


S26 


14.94 


6.79 


S27 


15.41 


4.41 


S6 


15.35 


4.66 


S7 


14.92 


6.92 


S8 


14.21 


13.31 


S35 


13.20 


33.74 


S10 


13.95 


16.91 


S65 


13.58 


23.78 


S30 


14.12 


14.46 


S98 


15.27 


5.01 


S100 


15.29 


4.92 


S84 


14.66 


8.79 


S107 


14.82 


7.59 



Note. — The flux density for each star was calculated correcting 
for extinction with m ex t = 2.46. 



For each aperture, we summed up its total content in 
analog-to-digital units (ADU). For the polarimetric data, 
the obtained ADU values of orthogonal channels were 
added. We subtracted the average background value (IB- 
apertures) in ADU from the calibrator values and con- 
ducted a flux density calibration u sing the photometri c 
values for the calibrators in Tab. [T1(Schod el et al.l l201Q). 
Because of the high proper motions of the stars within 
this field, the state of confusion of the calibrators changes 
from epoch to epoch. We applied the following algorithm 
to reduce the epoch-dependent systematic error of the 
calibration: 

First we calculated for each calibrator k the quantity: 



Ck 



fk 



ADU ADU 



10' 



0.4-m re f 



(i) 



with Ck the background subtracted ADU values for the 
k-th calibrator, and m re f its reference magnitude. We 
sorted these values, rejected the three largest and the 
three smallest values (for the data sets with less calibra- 
tors we accordingly reject a smaller number), and took 
the arithm etic averag e fp oy er the remaining values. Ac- 
cording to iTokunagal (|2000f ) we then obtained the mag- 
nitude ttia and flux density Fa for each aperture A by 



m A - 



-2.5 • log 



F A = 667- 10 3 • 10 



-0.4- (ra a -ra ex t) 



(2) 
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wi th m ex t = 2.4 6 the K -band extinction as determined 
in iSchodel et al.l (j2010h . This procedure ensures a best 
possible Constance of the flux density calibration under 
the changing conditions of each dataset. 

As a last step, we collected parameters for each frame 
that provide information on the data and calibration 
quality, which allowed us to reject data points based on 
objective criteria. These parameters are: Julian date, in- 
tegration time (NDITxDIT), rotator position angle (ori- 
entation of NACO), airmass, FWHM of the active optics 
guide star PSF, coherence time of the atmosphere, cam- 
era, all obtained from the header of the fits data, and the 
number of stars detected by Starfinder, the Strehl ratio 
calculated from the extracted PSF using the ESO Eclipse 
routine STREHL, the RMS of the values and, as the 
most important quality check, the average normalized 
flux of the calibrators. This last quantity is obtained for 
each frame by dividing the measured flux density of the 
individual by its reference value and averaging over all 
available (i.e. not rejected) calibrators. 

We emphasize that both methods - PSF-fitting and 
Lucy-Richardson aperture photometry - are in general 
equivalen t for estimating the flux density of a point 
source (jMever et al.l 12008). However, in the case of the 
presence of extended flux underlying a dim and confused 
point source observed under varying correction perfor- 
mance of the AO-system, it is more difficult to control 

• how Starfinder divides the given flux at a position 
into background and point source flux, 

• how the possible over-estimation of the flux due 
to noise-peaks influences the statistics of low flux 
densities (if the center of the fit is not forced to a 
given position), and 

• what non-detections due to the quality thresholds 
set in Starfinder mean for the overall statistics. 

Also LR-deconvolution has drawbacks, especially in 
handling extended flux that is added to existing point 
sources or gathered into artificial sources by the algo- 
rithm. We can account for this effect with a suitable 
big aperture and by monitoring apertures at positions 
without obvious point sources. Thus, the statistics of 
the interplay between the background (coming from un- 
resolved sources, truly extended emission and PSF con- 
tributions from the surrounding point sources), the AO 
and point-like flux at a given position is directly propa- 
gated to the statistics of the measured values, which is 
crucial for understanding the instrumental influence on 
our flux density statistics and our statistics do not suf- 
fer from non-detections that might introduce a selection 
effect. We will come back to the measurement statistics 
in section [3l 

2.3. Light curves of Sgr A* 

As a result of the reduction procedure described in sec- 
tion [221 we obtained the data shown in the upper panel 
of Fig. [21 F or convenience and follow ing the visualiza- 
tion used in iDodds-Eden et all (j201l[ ) , we show a con- 
catenated light curve with all time gaps longer than 30 
minutes reduced to the average sampling of the individ- 
ual datasets (1.2 min). This visualization shows the data 
of all nights as a pseudo-continuous light curve allowing 



for a comparison of the variability and the confusion in 
each epoch. A visualization of the true cadence is pre- 
sented in Fig. [131 The timing analysis in section [4] is 
based on the true cadence, and not on the concatenated 
light curve. 

These 12855 data points still include points of bad ob- 
servation conditions and insufficient calibration reliabil- 
ity. We used an objective frame rejection algorithm by 
incorporating information on seeing, Strehl ratio , frac- 
tion of stars detected by Starfinder 9 , the standard devia- 
tion of the Jo-values obtained from the individual calibra- 
tors in each frame , and the normalized average calibra- 
tor flux density as described in section [221 Only frames 
with seeing < 2" , a Strehl-ratio > 6%, a /o-standard 
deviation < 16% of the average Jo-value, and a normal- 
ized average calibrator flux density between > 0.96 and 
< 1.04 have been accepted. 

The top panel of Figure [21 additionally shows a long- 
time trend of the data from epoch to epoch. As far as 
these "of fsets" are concerned we a gree with the conclu- 
sions of IDodds-Eden et aTl (|2011h that confusion with 
stellar sources is responsible for this long-time trend in a 
first approximation. In order to make the different years 
comparable we subtracted the 2.5 percentile value of the 
flux in each epoch, resulting in mJy for 2003, 0.3 mJy 
for 2004, 1.0 mJy for 2005, 1.0 mJy for 2006, 1.3 mJy for 
2007, 2.8 mJy for 2008, 2.1 mJy for 2009, and 0.7 mJy 
for 2010. In 2003 and 2004 Sgr A* could be sufficiently 
separated from the near star S2 10 by the deconvolution 
reduction step. 

The second panel of Figure [21 shows the data after qual- 
ity cut and subtraction of the faint stellar contribution. 
For comparison and as an indicator of the calibration 
stability we show additionally the light curve of one of 
the calibrators, S7, in the third panel, and, in the low- 
est panel, the average calibrator flux density scaled to 15 
mJy. On average, the calibration is very stable and the 
data points after quality cut exhibit a relative standard 
deviations of the average calibrator flux of 1.4%. 

For a more detailed inspection we present 112 data 
blocks (defined by continuity without gaps of more than 
30 minutes) in Appendix O 

3. STATISTICAL ANALYSIS OF THE FLUX 
DENSITY DISTRIBUTION 

In this section we investigate the properties of the flux 
density statistics of the variability of Sgr A*. This anal- 
ysis gives information on both the intrinsic flux density 
distribution of the variability as well as the instrumental 
effects within our measurements. The differentiation be- 
tween instrumental features within the distribution and 
the intrinsic component turns out to be crucial in the 
context of the question whether the intrinsic flux density 
distribution provides evidence for two physical mecha- 
nisms at work. Additionally, this analysis allows us to 

9 This is a method to reject frames of lower quality with respect 
to the majority of the individual observation night. For each sub- 
frame of the night (i.e. the part of the jittered frames that is 
common to all) we count the stars detected by Starfinder. We then 
calculate the average number of detections over the night and the 
standard deviation and reject all frames with a negative deviation 
from the average of more than 1.7 times the standard deviation. 

u which is the reason why [Dpdds-Eden et al. ( 20TQ) did not 
include the 2003 data and which is clearly visible in the 2004 epoch 
of their uncorrected light curve 
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Fig. 2. — The concatenated light curve of Sgr A* and S7 with time gaps longer than 30 minutes reduced to 1.2 min. The top panel 
shows the result of aperture photometry before the quality cut, the next panel the same data after quality cut and removal of offsets. The 
third shows a light c urve of S 7, a nearby star that has been used as a calibrator. The time gaps are the result of the rejection procedure 
described in section 12.21 The lower panel shows the average ratio between the measured flux of each calibrator and its reference value 
(Tab. [TJ, scaled with a factor of 15. Its noise corresponds to the error a hypothetical noise- free aperture with a flux density of 15 mJy 
would have only due to the uncertainties of the calibrators. 

develop a full statistical model of the variability of Sgr A* 
in the next section. 

3.1. Optimal data visualization 

The first step in the analysis of the flux density distri- 
bution of Sgr A* is a proper graphical representation of 
the data in form of a histogram. A representation of our 
data in a simple flux density histogram, normalized by 
total number of points and bin size, is shown in Figure 02 

To investigate the high flux density tail of this distri- 
bution a logarithmic histogram with an equally spaced 
logarithmic binning is best suited. The number of bins 
for a given data range is a crucial parameter for the 
evaluation of the stru cture of the s ample distribution. 
Following the study of iKnuthl (|2006l ) we first determine 
the best bin size. As the author points out, the idea is 
to choose a number of bins sufficiently large to capture 
the major features in the data while ignoring fine details 
due to random sampling fluctuations. By considering the 
histogram as a piecewise-constant model of the probabil- 
ity density function from which n data points Xi were 
sampled the author derives an expression for the rela- 
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Fig. 3. — Flux density histogram of Sgr A*, based on the data 
shown in Figure [2] 



tive logarithmic posterior probability (RLP) for each bin 
number: 
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RLP = n log TV + 7Vlog T 



N 



•logrQ 

N 
A=l 



AT 

log T ( n + — 



(3) 



with TV the number of bins and nx the value of the Ath 
bin. To find the best number of bins M the posterior 
probability has to be maximized: 



M argmaxjRLP} 

N 



(4) 



The best estimator for the bin value fix and its variance 



g\ given the bin values nx is deduced to be: 



Ma 




and 



M 



(nA+j) (n-n A + M^i) 

L (-+¥ + 



My 

2 >/ 



(5) 



(6) 



with v the interval between the maximum and the mini- 
mum me a sure ment value. 

iKnuthl ([2006) demonstrates in his study that these 
results outperform several other rules for choosing bin 
sizes, e.g. "Scott's rule" or "Stone's rule". 

We applied the described binning method to our data 
of Sgr A*. To make th e flux density distri b ution compa- 
rable to the results by iDodds-Eden et al.l (|2011[ ). which 
include the flux density of the star S17 due to a double 
aperture method, we added 3 mJy to the flux density of 
Sgr A*. We find a best bin number of M = 32. The de- 
pendence of the log posterior on the bin number is shown 
in Figure |U The best piece- wise constant model (i.e., an 
estimator of the best histogram representation) describ- 
ing our sample is shown in Figure [5] The histograms in 
this work have been created using this method. 
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Fig. 4. — Optimal data based binning. We show the log posterior 
probability as a function of the number of bins. The maximum 
for the logarithmic flux density of Sgr A* is reached for 32 bins 
(blue line). The black lines represent bin numbers used for the 
histograms in Appendix [D] which clearly show that our results do 
not strongly depend on the binning. 
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Fig. 5. — Best piecewise-constant probability density model for 
the flux densities of Sgr A*. The red error bars are the uncertainty 
of the bin height for the full amount of 10 639 data points. The sec- 
ond bin height visible in some cases and the black error bars belong 
to the average h istograms of 1000 datasets with 6774 data points 
(Dodds-Eden et al. 2011), generated by randomly removing points 
from our full dataset. The over-plotted cyan and magenta dashed 
lines show the log-normal distr ibution and the power-law distribu- 
tion found in the analysis by Dodds-Eden et al. (2011), the blue 
the combined distribution of those components, convolved with a 
Gau ssian with a flux density-dependent a (compare Eqn. IA1I to 
lA4t . 

Additionally to the best histogram model obtained 
from Equation (j5j) and Equation (j6j) we over-plotted 
a grap h of the model 11 proposed by IDodds-Eden et all 
(2011) . It is obvious that our sample is more populated 
in the middle flux density range between 7 mJy and 15 
mJy and shows a linear behavior between 4 mJy and 17 
mJy, not showing any break or change of slope. That the 
shape of our histogram is not sensitive to the binning is 
shown in Appendix [Dj Figure [24j where we present his- 
tograms with 22 bins for the range of the observed flux 
density values (resulting in a comparable bin width as 
used bv IDodds-Eden et al.ll2Qllh and 45 bins 12 , both re- 
producing the linear trend. Rather than being a matter 
of representation, this difference is related to the different 
sample selection (10639 data points i n this work in com- 
paris on to 6774 points in the case of IDodds-Eden et al.l 
|2011[ ) . To be tter understand the chara cter of the selected 
subsample in IDodds-Eden et all (j2011l ) (their quality cut 
is based on the visual impression of the individual frame), 
we selected randomly 6774 points from our sample (by 
nonparametric bootstrapping, i.e. sampling with re- 
placement) and generated 1000 surrogate datasets in this 
way, binned each dataset in a histogram with the same 
bin size as for our total sample, averaged the bin val- 
ues over all surrogate sets, obtained an error from the 
standard deviation, and plotted the result as the second 
bin height (now with black error bars) in Figure [5] One 
can clearly see that for most of the bins there is barely 
any difference, showing that a random influence cannot 
be responsible for the difference of both distributions. 
This test shows the robustness of the linear behavior of 
the histogram also in the case of smaller datasets under 
random selection. Futhermore, in our dataset we do not 

11 slightly shifted on the x-axis to account for the proper off- 
set du e to the double aperture method used bvlDodds-Eden ^et al.l 
(2011), which before we included only roughly by adding 3 mJy, 
and to make it fit best the extremes of our histogram 

12 The latter we show in an unweighted and in an integration 
time-weighted version. 
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see the observation conditions to be correlated with the 
flux density states of Sgr A* (compare Figure [2] and Fig- 
ure [20] in Appendix IB]) . In general also data worse than 
average should be represented in the uncertainties, and 
not simply eliminated, because otherwise errors might be 
underestimated due to an introduced bias, and we have 
to conclude here that p robably the subsample used by 
iDodds-Eden et aTl (j2011f ) shows a severe selection effects. 

The linear behavior in the log-log diagram points to- 
wards a power-law distribution p ex x~ a as a possible 
description for all flux densities higher than ~ 4mJy. 
The statistical significance of this visual impression is 
analyzed in the next section. We mention here that a 
power-law distribution is only showing a linear behavior 
in a log-log diagram if it is of the form: 

p oc {x - x )~ a , x = . (7) 

Otherwise the logarithm log(p) of the probability density 
is only linear as a function of log(x) for large values of x: 



as a function of the mean flux density is shown in Fig- 
ure [71 For o ur photometry we clearly find smaller uncer- 
tainties than lDodds-Eden et al.l (|2011l ) for their aperture 
photometry, very similar to their S tarfm der photome- 
tr y, or the photometry of iDo et al.l j2009). In contrast 
to IDodds-Eden et aTl (|2Q11[ ) , the functional dependency 
shows a clear flattening towards small flux densities 13 
and we can not confirm the dominance of photon noise 
(|Dodds-Eden et all 1201 ![ ). We find a parabola to be a 
suited (phenomenological) description up to flux densi- 
ties of 32 mJy. Actually the uncertainties in the flux 
density range between and 10 mJy are more or less 
constant. This corresponds well to the visual impres- 
sion that the variable AO-correction and its influence on 
the background con tribution due to PSF halos of bright 
sources (halo noise, iFritz et all 120101 ) as well as differen- 
tial tip/tilt jitter and their interplay with the deconvo- 
lution are the dominant reasons for the uncertainties. 



log(p) = —a • \og(x — xo) + const (8) 
= —a • log(x) — a • log ^1 — — ^ + const . 

This is the main r eason why the distribution claimed in 
IDodds-Eden et aTl (|2011[ ) shows a break: In this case the 
high flux density tail is described by a power-law with 
xo ^0.8 mJy, and this power-law does not show a linear 
behavior in the log-log diagram if plotted versus the sum 
of intrinsic flux density, background and flux density of 
S2. It starts to deviate from a linear behavior (it, so to 
say, "breaks" ) close to the transition value F t of the total 
distribution (see Fig. [5j blue and magenta dashed line). 
Thus, even if we accept the selection of data points by 
IDodds-Eden et all (|2011[ ). the visual impression of the 
necessity of introducing a break in the distribution is a 
feature of the data visualization, and the power-law actu- 
ally is suited also in their case to describe the data down 
to a flux density value of about 4 mJy quite well. This 
m eans, even under the assu mption that the data selection 
of IDodds-Eden et~aTI (j2011l ) is valid, the discussion of a 
double state model for Sgr A* and its significance is very 
much depending on the evidence of a log-normal model 
describing the low flux density part of the histogram. We 
come back to this point in the next section. 

The reason why we see the distribution behaving lin- 
early inour visualization is a coincidental equality of 
— xo = 3 mJy in our power-law model (see next section) 
and the flux density of S17 (~ 3 mJy) which we added 
to the flux density of Sgr A* to make it comparable wi th 
the distribution proposed bv IDodds-Eden et al.l (j201l[ ). 

3.2. Power-law representation of the intrinsic flux 
density distribution 

We begin with a description of the instrumental effects 
and uncertainties of our photometry. Figure [6] shows 
the flux density histograms of 10 stars (calibrators and 
comparison stars). As expected, the width of the distri- 
bution is decreasing with decreasing flux density of the 
source. To estimate the FWHM of the scatter of our 
photometry at a given flux density, we fit Gaussian dis- 
tributions to the histograms. The a values of these fits 



2.5 rr 




flux density [mJy] 

Fig. 6. — The normalized flux density histograms of ten cali- 
bration stars. The dashed curves are Gaussian fits to the flanks 
of the distribution, suppressing the broader tails of the distribu- 
tions. This guarantees a proper measurement of the FWHM of the 
distributions. 
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13 This of course corresponds to a hyperbolic increase of the 
relative error toward small flux densities. 
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Fig. 7. — The measurement error as a function of flux density. 
The purple line is a quadratic fit to the measured cr-values of the 
calibration stars shown in Figure [6] (blue crosses) . Th e red line 
is the power-law dependency found by Do et al. (2009|) for their 
data, the green line and the black line the dependency found by 
Dodds-Eden et al. (2011) for their aperture photometry and their 
PSF-fitting photometry, respectively. The two magenta crosses 
represent the measured cr-values at the position of the C-apertures, 
compare Figure [T] 



The role of halo noise becomes evident when looking 
at the control apertures close to Sgr A* (Figure [TJ C- 
apertures). Their average flux density is clearly not zero, 
and the flux density values are scattered around the mean 
with a FWHM comparable to the width of all flux den- 
sity distributions of the stars fainter than 10 mJy. The 
aperture west of Sgr A* shows a varying contribution of 
faint confusion on the level of a few tenth of a mJy (Fig- 
ure [5J blue histogram) . The average flux density of these 
"empty" apertures is about 0.6 mJy. 
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Fig. 8. — The flux density histograms of the two background (C-) 
apertures (blue, brown) close to Sgr A* in comparison to the flux 
distribution of Sgr A* (magenta). The dashed curves are Gaussian 
fits to the flankes of the distributions. 



In the following we investigate if a power-law distribu- 
tion indeed is suited to describe our sample. We follow 
the strategy described in lClauset et al.l (j2007l ) for identi- 
fying power-law distributions and determining their pa- 
rameters. The authors of this study point out that a 
least square regression to a histogram in log- log represen- 
tation can generate significant systematic errors, mainly 
due to the non-Gaussianity of the variation of the log- 
arithmic bin height. Furthermore, binning the data in 
a histogram introduces further parameters corrupting 
standard goodness-of-fit estimators. The procedure de- 
scribed in the following overcomes this problems. 

The probability density of a power-law distribution is 
defined as: 



P(x) = \ a-l( x^xoy* . x>x 

^ 3?min y 3?min J 



% — ^min H~ %0 

f x 



(9) 



with x m i n = x m i n5 i n tr — %o and Xmin^intr the lowest value 
to which the data is power-law distributed, making the 
power-law normalizable with normalization factor (a — 
1) • ^in- The corresponding cumulative distribution is 
given by: 



P{x) 



1 

l-a 



X 

x 



(10) 



% — ^min 

/ _ \ 

l x xp \ ^> r 
I V x m i n / x mm 

The step s for analyzing powe r-law distributed data are 
as follows dClauset et al.ll2007D : 

• Find estimators for x m in and xq and the maximum 
likelihood estimator for a. The maximum likeli- 
hood estimator for a given any value for x m in and 
xq is calculated using the equations 



a es t = 1 + ntaii 



C"est 



E ln: 



Xq 



1=1 



^est 



V n tail 



(ii) 



(12) 



with x > x m i n + %o and n ta ii the number of data 
points higher than x m in + xq. Estimators for x m in 
and xq are obtained by choosing x m in and xq 
in a way that makes the probability density and 
the best-fit power-law model (i.e. the power-law 
with a the maximum likelihood estimator) as sim- 
ilar as possible. The similarity is estimated by 
Kolmogorov-Smirnov (KS) statistics: 



D — max \C(x) 

X>X m { n + Xo 



P(x)\ 



(13) 



with P{x) the cumulative distribution of the best- 
fit power- law model and C(x) the relative cumula- 
tive frequency of the empirical data sample. The 
parameter D has to be minimized. The error on 
x m i n and xq can be found by a nonparametric 
"bootstrap" method, i.e. (given n empirical data 
points) by drawing a new set of n data points uni- 
formly at random from the empirical data and de- 
termining the standard deviation of x m [ n and xq 
for these surrogate samples. To replicate the act 
of drawing an independent, identically distributed 
sample from the population the surrogate data has 
to be drawn with replacement. 

• Test for plausibility by calculating the goodness-of- 
fit between the empirical data and the power-law. 
The goodness-of-fit parameter q is defined as the 
fraction of synthetic data drawn from the best-fit 
probability model that has a worse KS statistics 
than the empirical data sample. Here it is impor- 
tant to obtain the KS value D for each synthetic 
sample in the same way as for the empirical data, 
i.e. the estimators for x m i n , xo and a have to be 
found for each synthetic sample individually and 
the KS values has to be calculated relative to the 
individual best-fit power-law. To ensure that the 
x m in values are determined under the same condi- 
tions like for the empirical data sample we have 
to make sure that the synthetic sample follows the 
empirical data sample distribution for the values 
smaller than x m in + #o- This is realized by choos- 
ing with probability 1 / ritai! a random number from 
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the best-fit empirical power-law and with proba- 
bility 1 — 1/ntaii a value from the empirical data 
below the x m i n + xq estimate (with n ta ii the num- 
ber of data points higher than x m i n + xq). A q- 
value > 0.05, or more conservative > 0.1, makes 
the power-law model a plausible assumption. 

With a simulation of 1000 surrogate samples (again ob- 
tained by nonparametric bootstrapping) for determining 
the errors of x m [ n and xo and 6000 synthetic samples for 
testing for plausibility we obtained the values: 

x = (-2.94 ±0.1) mJy 
x min = (4.22 ± 0.1) mJy 
a = (4.215 ±0.05) 

(7 = 0.2 . (14) 

A goodness-of-fit parameter q = 0.2 means that in 
a fifth of the cases a sample drawn from a power-law 
with parameters as in Equation will show deviations 
worse than our empirical sample, making the power-law 
description plausible for all flux densities higher than 
4.2 mJy (note that the exact value of x-axis offset is 
xo = —2.9 mJy, close to the value found due to the linear 
appearance of the histogram in the log- log diagram). A 
diagram of the cumulative distribution function (CDF) of 
the best-fits power-law and the empirical relative cumu- 
lative frequency is shown in Figure [9] . For a comparison 
we also over-plotted the CDF of the model proposed by 
iDodds-Eden et all ([20111 ) (restricted to flux density val- 
ues higher than x m { n ). Figure [TUl shows the value of the 
power-law slope a as a function of x m { n . As expected, 
the best x m { n is close to the point where a starts to be 
co nstant for a range o f £ m in values (compare Figure 3.3 
in IClauset et aTll2007[ ). 




flux density [mJy] 



Fig. 9. — Estimation of the goodness parameter p by a Kolmogrov 
statistic. The value p is defined by the maximum difference of the 
measured CDF (black) with respect to the CDF of the best fitting 
po wer-law (red). In green w e show the CDF of the model proposed 
by Dodds-Eden et al. ( 2011) restricted to flux density values higher 

than iCniin - 

Equation (fTTj) only represents the maximum likelihood 
estimator for the power-law slope if the Xi are indepen- 
dent or at least uncorrelated, otherwise the estimator is 
biased. However, we know that for our sample the x^i are 
not uncorrelated, since the flux densities are occurring 
in "flares", it means in a time-continuous development. 
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Fig. 10. — The scaling par ameter a as a functi on of the value for 
^min (compare Figure 3.3 in Clauset et al ,1 120071 V 



This describes simply the fact that finding the flux den- 
sity to be at a level of 15 mJy implies very low probability 
for a level of 5 mJy to be reached within e.g. the next 3 
minutes. This predictable behavior disappears on longer 
timescales, and it is not possible anymore, knowing the 
flux density at a time point to, to predict the flux den- 
sity level e.g. 100 minutes later very r eliably. This has 
been investigated bv lMever et al.l (j20Q9h in their analysis 
of the power spectral density discovering a timescale at 
about 150 minutes. Here we are analyzing data covering 
about 15 000 minutes, a hundred times longer than the 
timescale on which the correlation of the data vanishes. 
This means to first order and because the estimator in 
equation Equation ([TT]) is most sensitive to data points 
close to x m i n (where the histogram is most populated and 
the value of each bin can be considered to be fairly inde- 
pendent from the neighboring bins), the bias is negligible. 
For the high flux density tail the bias due to the time cor- 
relation is significant, because here the histogram bars 
are populated only with data points of the rare strong 
outbursts. In the case of our dataset all histogram bars 
higher than about 17 mJy are populated only due to one 
very bright outburst. This is the reason why for higher 
flux densities the empirical cumulative distribution de- 
viates more and more from the ideal CDF of the model 
and the statistics becomes incomplete. This effect is also 
visible in Figure [25] of Appendix [Dj where we show the 
ideal CDF, the empirical cumulative distribution and the 
cumulative distribution of some of the uncorrelated syn- 
thetic samples we generated for the estimation of the 
goodness parameter q. For high flux densities many syn- 
thetic datasets show a closer development with respect 
to their best-fit CDF than our empirical sample, even if 
their KS value D is worse. In this context it is impor- 
tant to notice that the synthetic samples with a worse 
KS statistics may differ significantly from the ideal CDF 
at lower flux density ranges which is not well visible in a 
log- log diagram (see Figure [25j Appendix ID]) . These ar- 
guments also show that a x 2 -minimization fitting to the 
histogram is questionable, especially if the x 2 -values are 
used to establish the significance of a distribution break 
based on the highest, most correlated bins. 

Up to this point we found a description for flux densi- 
ties higher than x m [ n . We can show that our data sample 
is consistent with a pure power-law describing the intrin- 
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sic flux density distribution under the influence of an in- 
strument with limited resolution and sensitivity and that 
x m [ n can be interpreted as the detection limit of NACO 
for Sgr A* due to being embedded in extended flux and 
its confusion by faint unresolved stars. The argument 
is simple: If we weight the power-law distribution for 
fluxes higher than x m i n , with a factor n ta i\/n with n ta ii 
the number of data points with values higher than x m i n , 
we can extend the power-law to smaller flux densities 
until its integral becomes unity: 



^ ^tail ^ <*-i 



(15) 



In our case with the values of Equation ([14]) we find 
x min = 3.57 =b 0.1 mjy. Correcting this value for xq = 
—2.94 ±0.1 mjy, this power-law distribution shows a cut- 
off at x^ in + xo = 0.63 ±0.15, which is identical to the av- 
erage flux density of the two background apertures close 
to Sgr A*. The measured distribution now can indeed be 
obtained by convolving the power-law distribution with 
a Gaussian distribution to account for the uncertainty of 
the photometry: 



PobsO) 



exp 



- z) 
2a* 2 



dz , (16) 



with a* the HWHM of the error distribution, which for 
our data sample can be considered constant up to ~ 15 
mjy (see Figure [7j). For higher values the histogram 
starts to be incomplete, and the bias due to the time 
correlation is dominating the statistical errors. The slope 
of the power-law is slightly changed by the convolution, 
but for values of a* of the magnitude of the observational 
errors this effect is within the errors of a. 

By using again KS-statistics we find a constant value 
of a* « 0.32 mjy as a best fit to the histogram (see 
Figure [T2]) . This value is larger than the observed er- 
ror a* « 0.2 mjy of better separated sources (see Fig- 
ure [7j). The reason is that the photometry on Sgr A* for 
a large fraction of the data is influenced by the nearby 
star SI 7, and that we subtracted a constant offset from 
epoch to epoch, for which it is difficult to find a real- 
istic error. Both influences effectively broaden the ex- 
pected distribution (which as a result is not Gaussian 
anymore in general, but somewhat less peaked). That 
this is not a disadvantage of the deconvolution method 
with respect to the doubl e-aperture method preferred by 
iDodds-Eden et al.l ([201 lh can be seen in Figure The 
error of the double aperture photometry for the low flux 
density range of Sgr A* (starting at flux densities > 3 
mjy) is also in the range of > 0.3 mjy. 

We conclude that it is not possible to verify the evi- 
dence of an intrinsic turnover (that could indicate the 
peak shape of a log-normal distribution) based on the 
larger scatter of the low states of Sgr A* with respect to 
the typical error of a better separated source of this flux 
density. The difference of 0.1 mjy between the typical 
error of a faint source of 0.2 mjy and the error value for 
our best fit of 0.32 mjy is well within the uncertainties 
of our knowledge about the true error distribution at the 
position of Sgr A* as Fig. [8] demonstrates. Having no ev- 
idence for a log-normal distribution for low flux density 




Fig. 11. — Schematic view of the power-law flux density distri- 
bution and the parameters x m in, ^min' x ° an( ^ ^he background 
flux density (y-axis in arbitrary units, x-axis in mjy). We show 
the measured flux density distribution (grey area) after adding 2.94 
mjy to account for xo, the pole of the measured non-shifted power- 
law (which belongs to the non-shifted distribution indicated by the 
long-dashed line), and the intrinsic distribution with and with- 
out correction for xo (described by the continuous lines ex x~ a 
and oc (x — xo)~ a , respectively). As in Eq. (|9|h x m j n is the 
minimum flux density down to which the shifted measured dis- 
tribution is a power-law, and x^ in is the minimum flux density 
obtained by an extrapolation of the power-law toward lower val- 
ues, assuming the distribution below x m in to be dominated by 
instrumental effects. Therefore, a^in represents the intrinsic min- 
imum flux density in the case of the xo-shifted distribution. For 
the case of the non-shifted distribution the intrinsic minimum 
is represented by x^ in + xo, which equals the background flux 
density within our uncertainties. Thus, the intrinsic minimum 
x min intr = X min + — backgr. (now additionally corrected for 
the background) equals zero. 



values 14 , the necessity of a break in the distribution to 
account for the highest flu x densities vanishes , even if we 
accept the data selection of lDodds-Eden et al.l (|2011f ) and 
ignore the fact that all flux density bins higher than 17 
mjy are populated due to one bright event only. Rather 
than a double state description we prefer a simple power- 
law with a slope ofa = 4.2±0.1 and an intrinsic pole at 
#o,intr. = ^o — backgr. = — x^ in = — 3.57±0.1 mjy. Since 
flux density is a positive quantity, this intrinsic power-law 
naturally breaks at x^ in intr = x^ in + £ ,intr. = mjy. 
The instrumental effects on the photometry are suffi- 
ciently described by a Gaussian distribution centered 
around the background value of 0.6 ± 0.1 mjy with a 
constant a = 0.32 mjy. This instrumental effect leads 
to a detection limit, which here is defined as the limit 
up to which a reliable photometry is not possible, of 
~ 0.7 ± 0.16 mjy intrinsically, and ~ 1.3 ± 0.15 mjy for 
the actual measurements which include the background 
flux density of 0.63 ± 0.15 mjy. With the power-law 
model we find a median value for the flux density of 
raed(x)intr. = 0-9 =•= 0-15 mjy (corrected for the back- 
ground), or me<i(x)obs. = 1.5 ± 0.1 mjy (including the 
background flux density). This shows that, assuming we 
can extrapolate the power-law to smaller flux densities 
below the detection limit, we find the average flux den- 
sity to be very close to the detection limit, indicating a 

14 because the shape of the intrinsic distribution close to the 
detection limit cannot be determined accurately enough to make a 
difference between a log-normal behavior and a truncated power- 
law 
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severe limitation of the knowledge about the variability 
of Sgr A* we are able to infer from our data. The rela- 
tion of xo, #min, ^min an d the background flux density is 
schematically shown in Fig. [TTJ 

Of course we cannot prove that the flux density dis- 
tribution is a strict power-law distribution. We only can 
show that the observable intrinsic flux densities can be 
well described by this model. This assumption is simpler 
and needs less parameters than than the assumption of a 
broken distribution. Nevertheless, it might well be that 
the real distribution shows some structure at flux den- 
sities below the detection limit. In particular it might 
even follow a log-normal distribution (with a high mul- 
tiplicative standard deviation to account for the linear 
appearance in the log-log plo t). The log- normal distri- 
bution used in the model of [Dodds-Eden et al.l (j2011l ) 
and an evidence for a break in the distribution at an 
observable flux density level, however, can be ruled out. 
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Fig. 12. — Flux density histogram like in Figure [5] The blue line 
shows the extrapolation of the best power-law fit, the cyan line the 
power-law convolved with a Gaussian distribution with a = 0.32 
mJy. 

For the sake of comparability with other objects that 
show log-normal distributions (e.g. iGaskelll I2QQ4T ) we 
want to include here the best fit parameters for a simple 
log-normal model. It is not easy to estimate the pa- 
rameters (and their uncertainties) of a log-normal distri- 
bution describing the intrinsic flux. The reason is that 
due to the described instrumental effects we do not have 
precise knowledge about the shape and position of the 
intrinsic peak. This is a more severe restriction in the 
case of a log-normal model that is characterized by its 
intrinsic turnover than for the power-law model. 15 Fit- 
ting the linear hist ogra m with a log-normal model as de- 
fined in Equation (|A4ft convolved with a Gaussian (with 
the width of the Gaussian as a free parameter) we ob- 
tain a best fit for a* = 1.00 ± 0.05, /i* = 0.12 ± 0.07, 
Xb = 3.38 ± 0.15, a width of the Gaussian distribution of 
a* = 3.38 ± 0.06 and a x 2 / dof = 1-6- The given uncer- 
tainties a larger then the formal uncertainties. In partic- 
ular, they allow for larger deviations from the histogram 
at low flux densities than implied by the statistical errors 
of the bin. This accounts for the fact that the true in- 
strumental effects are only approximately Gaussian (due 
to the non-uniform white noise contribution in each night 

15 The latter allows for a separation of the part of the histogram 
that is assumed to represent the intrinsic distribution of flux den- 
sities (> x m i n ) from the part dominated by instrumental effects. 



and the epoch- wise correction for stellar confusion). The 
intrinsic median flux density of the log-normal model is 
4.51 ± 0.2 mJy and equal to the the corresponding value 
for the power-law model of 4.47 ± 0.2 mJy (both for the 
shifted distribution). 

4. TIME SERIES ANALYSIS 

In this section we investigate the nature of the already 
mentioned time correlation of the flux density measure- 
ments. In a more formal description: Assuming that all 
measurements are samples of the very same random pro- 
cess, and starting with the idea that this random process 
is (weakly) stationary (for which the impression of a sta- 
ble mean and variance is indicative), we consider the flux 
density distribution of section [3] as the marginal proba- 
bility distribution of the random process. Now we want 
to find a characterization of the joint probability distri- 
bution. Whereas in general 'red noise' light curves 16 only 
can be considered as weakly non-stationary, the (weak) 
stationarity for the underlying rand om process is acon- 
sequence of the PSD break found bv lMever et "all ([2009) 
that is far shorter than the covered time period of 15 000 
minutes. 

A first, very simple approach for characterizing the 
time behavior of the variability is the following: Let 
us associate the average sampling (of the concatenated 
light curve) of ~ 1.2 minutes to every data point. In 
this way we can relate the total time the source spent 
in the range of a given flux density bin to the total time 
covered by observations (~ 15 000 minutes), and we get 
a rough estimator for the fraction of time the source 
spends at that flux density. For a more detailed analysis 
we have to use standard time series analysis tools, like 
periodograms as estimators for the power spectral den- 
sity (PSD) of the process, Lomb-Scarle periodograms, 
the autocorrelation function or the structure fun ction 
(IScardd[l 982: Pr iestlevl[T98l ISimonetti et al.lll985 ). As 
I Meyer et al.l (|2009f ) point out the given window function 
(covering ~ 3.6 • 10 6 min with a coverage fraction of only 
~ 0.4%) makes standard Fourier transform techniques 
unsuitable (see Figure [T3|). Similarly the Lomb-Scargle 
periodogram, generally suited as a PSD estimator for 
non-equally sampled data, is based on t he average sam - 
pling, which in this case is > 3000 min. IDo et all (2009) 
used the approach of comparing and averaging the Lomb- 
Scargle periodograms of data subsets with similar length 
and sampling to access the PSD of the higher frequencies. 
In our case this approach again would probably intro- 
duce selection effects, and we decided t o gener ally follow 
the method presented by iMever et al.l (|2009l ), a Monte 
Ca rlo (MC) approach , similar to the PSRESP m ethod 
bv lUttlev etalT (|2002f ). iDodds-Eden et al.l (|201lh men- 
tion t hat t he MC simulation approach used in IDo et all 
(2009) and IMever et al.l (|2009l ) are based on a comparison 
sample with a flux density distribution that is Gaussian, 
and in particular allows negative values, questioning the 
validity of the method. In the following we overcome 
these concerns by developing an algorithm that allows 
us to simulate time series with the flux density distribu- 
tion we observe. 

16 I.e., light curves with a time-correlation characterized by a 
power-law like power spectral density 
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Fig. 13. — Light curve of Sgr A* like in FigureE] In this case no time gaps have been removed, the data is shown in its true time coverage. 
A comparison of both plots shows: Only about 0.4% of the 7 years have been covered by observations. 



4.1. RMS -flux density relation 



2.5 - 




flux density [mJy] 

Fig. 14. — Relation between the rms (on timescales of 2 to 16 
min) and average flux density for data segments of 24 min (blues 
points). In red the re-binned data, in black the best linear fit. 

For the study of X-ray binary variability (linear) rms- 
flux relations represent an important piece of informa- 
tion. A relation between rms and the radiation flux 
was initially discovered in observa tions of black- hole an d 
neutron star X-ray binaries ([Uttlev fc McHardvl 12001). 
Since then the rms-flux relation has been studied in 
a several other observations of black- hole binaries, ac- 
tive galactic nuclei, neutron-star X-ray binari es, and an 
ultra -luminous X-ray sources (ULX) (see e.g. iHeil et aD 
1201 ll and references cited therein). The rms (root mean 
square) - as it is used here - is a measure for the mag- 
nitude of th e var iability of the light curve. Following 
lUttlev et al.l (|2005[ ) the absolute rms amplitude of vari- 
ability <r rms of a time series of n data points, a^, is defined 
as as: 



case exponential statistical, model. With this model the 
authors can convincingly reproduce the behavior of the 
observed X-ray light curves and rule out additive (shot- 
noise) models or self-organized criticality as the respon- 
sible processes. They conclude that the variability pro- 
cesses must be multiplicative. Because the rms-flux re- 
lation is stable for all spectral states of the black hole 
X-ray binary Cyg X-l, independent of its PSD shape, 
it is believed to be a more fundamental property of the 
variability than the PSD shape. 

We report here the discovery of the rms-flux relation 
for Sgr A* in t he NIR. Following t he description of the 
rms analysis in lUttlev et al.l ([20051 ) and, using time se- 
ries IDL-codes written by S.Vaughan, we estimated the 
flux density dependency of the rms on a frequency range 
from the average Nyquist-frequency of ~ 0.5 min -1 to 
0.06 min -1 (corresponding to timescales of 2 to 16 min- 
utes) for data sections with a length of 24 min. The 
algorithm works as follows: We divided the light curves 
in continuous data segments of 24 minutes, took the av- 
erage as a flux density estimate, and determined the PSD 
of this section. We then obtained the rms for timescales 
between 2 and 16 minutes by taking the square root of 
the integral of the PSD over the corresponding frequency 
range. 

The dominant ti mescale is of the or der of a few hundred 
minutes (150 min jMever et al.ll2009f ). Realizations of the 
random process that are not significantly longer than this 
timescale are weakly non-stationary. So for a given mean 
flux density the rms-values of our data sections of 24 
minutes are significantly scattered around their average 
value, and we re-binned the obtained rms-values into flux 
density-bins with a width AF = 1.2 mJy. The result is 
shown in Figure [I4j To a first approximation, we find a 
linear rms-flux dependence. 
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(17) 



In the case of weakly non-stationary segments of a sta- 
tionary light curve, <r rms varies randomly about a mean 
value. Under certain circumstances this mean scales with 
the average flu x density of the segm ent (x). 

In particular, lUttlev et al.l (|2005l ) related the rms-flux 
relation (which could be observed on all timescales for 
some of their sources) to a formally non-linear, in their 



4.2. Simulating light curves 

A simple argu ment for the p l ausi bility of a rms-flux re- 
lation is given by lUttlev et al.l (j2QQ5h . The authors point 
out that a product of two sinusoidal variations with two 
well separated frequencies, where the lower 'modulates' 
the amplitude of the higher frequency, would show a lin- 
ear rms-flux relation. In contrast to this, a linear and 
Gaussian random process would not show any correla- 
tion between flux density and rms. Because the rms-flux 
relation in their case holds for all observed timescales, 
the authors choose the ansatz: 
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x(t) = Y[[1 + Ai sin (2-KVit + &)] 



(18) 



For this multiplicative sine model the authors then can 
show that x t under general conditions is log-normal dis- 
tributed, that it can be obtained from a Gaussian linear 
random process l(t) by the transform x t ~ exp[Z(t)], and 
that for this kind of transform one indeed can derive the 
rms-flux relation to be a linear function. 

Whereas the rms-flux relation often is considered as 
indicative for a multipl icative process bei ng at work, it 
actually can be shown (|Uttlev et al.ll2QQ5l and references 
therein) that, for every non-Gaussian, skewed distribu- 
tion, the sample mean and variance are correlated (the 
distribution is heteroskedastic). Thus, another, less fa- 
vored possibility to explain the rms-flux relation and the 
skewness of the flux density distribution is simply a non- 
Gaussian linear process. It is because of this reason that 
one has to speak of a formally non-linear description, 
and, without the modeling in the framework of a con- 
crete physical model, it must remain unclear whether the 
non-linearity has a physical meaning rather than being 
the property of the mathematical description. 

In this case we do not know, if the observed rms-flux 
relation also is valid for a bigger range of time and flux 
density scales. As mentioned in section [3j the median 
flux density of the intrinsic distribution is close to the de- 
tections limit, i.e. we only detect the variability of Sgr A* 
about half of the time with reliable photometric accuracy. 
Furthermore, we have to use data segments of a length 
comparable to or shorter than the dominant timescale. 
Additionally, the segments have to be comparably short 
with respect to the typical data length (about 130 min- 
utes on average) to provide a sufficiently big number of 
rms-flux data pairs. On the other hand, the segments 
should be long enough to contain enough data points for 
a reliable rms estimation. In the case of our data, these 
constraints only allow an investigation of the timescales 
pres ented in section | 4. 11 

In lUttlev et al.ll2QQ5l the authors use Equation ([T8]) to 
show the plausibility of the multiplicative approach in 
the context of an rms-flux relation and a log-normal dis- 
tribution, deducing an exponential transform of a linear, 
Gaussian random process as a good approximation of 
their random process. Thus, in our case we ask which 
is the transform that has to be applied to a Gaussian 
distributed random variable to obtain a random variable 
that is power-law distributed with the parameters found 
in section 13.21 We then assume that we can apply this 
transform to a linear, Gaussian process to find a descrip- 
tion of the observed process. This actually is a standard 
method for generating Fourier transform based surro- 
gate data with a non-linear appearance (see lTheiler et al.1 
Il992h . 

T he st atistical flux density model we described in sec- 
tion 13.21 has a simple analytic form. This allows us to 
deduce an analytic transform 



xt = T(y t ) 



(19) 



with y t a Gaussian, linear process with unity variance, T 
the transform, and x t a power-law distributed process. 
In the following we describe how we find this transform. 



Let y be a random variable with Gaussian probability 
density p y of zero mean and unit variance: 



p y (y) 



exp 



2 



(20) 



and x a random variable with a power-law probability 
density p x : 



Px(x) 







x < c 
x > c 



(21) 



Let assume that a > 1. We are looking for a trans- 
formation x — T(y) such that p y transforms to p x . For 
such a transform the probability to find a value in the 
immediate surrounding of y has to equal the probabil- 
ity to find a value in the immediate surrounding of the 
corresponding x = T(y): 

p y (y)dy = p y [T-\x)] d ^ dx = p x (x)dx , (22) 



dx 



or 



PX {X) = P y [T-^X)] 



d[T-\x)] 



dx 



(23) 



To solve this equation we use a normalization argument: 
[ V p y (y')dy'= f X pAx')dx' . (24) 

J — oo J c 
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Equation 



x ° dx = 



{a -l) 
can be reduced to 
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(25) 



(26) 



with erf(y) the Gaussian error function. This can be 
solved for x: 



r 



T(y) . (27) 



To reproduce the variability with and intrinsic power- 
law distribution on top of a constant background flux as 
discussed in section 13.21 we have to replace c with x^ in 
and subtract xq: 



T(yt) 



l + erf 



m_ 

V2 



(1-cO 



Xq 



(28) 



with in defined in Equation ([15]) , xq as in Equa- 
tion (jl4|) . and a the slope of the power-law. 

With this transform, we are able to generate surrogate 
light curves (i.e. single realizations of the underlying 
process) for any input power spectral density (PSD) with 
the following algorithm: 

• Generate a Gau ssian, linear light curve following 
the method by iTimmer fc Koenid (|1995l ). This 
includes drawing Fourier coefficients for each fre- 
quency from a Gaussian distribution with a vari- 
ance proportional to the value of the PSD at that 
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frequency, and Fourier transforming to time do- 
main. 

• Normalize the obtained Gaussian process to a vari- 
ance of unity. Optionally re-sample the equally 
spaced data to the cadence of the observed data. 

• Transform the light curve according to Equa- 
tion (pU and Equation (|28|) . 

• Add an independently drawn quantity (e.g. Gaus- 
sian) for each time point to account for the white 
noise contribution of the measurement. 

As we show in the next section, the obtained surro- 
gate data represent the observed flux density distribu- 
tion (by construction) and the time behavior of the light 
curves of Sgr A*. Applied to a linear Gaussian light 
curve with the PSDs discussed in section [4731 we obtained 
light curves of an typical appearance as shown in Fig- 
ures [26] and [271 The non-linearity introduced by a trans- 
formation as described here is static. This means that 
for each time point we apply the same transform, and 
the non-linearity is only in the amplitude distribution 
of the observed qu antity and not in its dynamics (see 
iTheiler et al1[l992h . This can be illustrated with light 
curves generated with a double broken power- law PSD 
of the form: 




fj < fa 

fa < fj < fb 

fb < fj 



(29) 



< ot2 < as 



with f a < fb the break frequencies, and ot\ 
the power-law slopes. 

With the algorithm described above we generated 100 
equally sampled light curves with a sampling of 0.1 min 
and a length of 50 000 minutes. For each surrogate light 
curve we calculated the periodogram as an estimator of 
the PSD after transformation according to: 

Per(f j ) = |DFT(f j )| 2 



cos(2?r fjU) 

n 

sin(27r fjU) 



(30) 



with DFT the discrete Fourier transform. For a review of 
comm on conventions of normalization see lVaughan et aD 
(|2003[ ). Here we want to compare the shape of the 
PSD, and due to the normalization step applied to the 
Gaussian linear light curve only relative power is of im- 
portance. Because the Fourier based periodogram is 
not a consistent estimator of the PSD (i.e. for a sin- 
gle realization its standard variation is equal to the 
mean values at each frequency, irrespective of number 
of dat a points) some kind of averaging has to be ap- 
plied (jTimmer fc Koenid[l995l IVaughan et al.ll2QQ3h . so 
we averaged over the 100 surrogate sets we generated. 

A comparison of the input and the output PSD is 
shown in Figure [TU In the first approximation and with 



the exception of the high-frequency part (that is domi- 
nated by the white noise contribution) and a calibration 
factor the PSD is invariant under the transformation. 
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Fi g. 1 5. — The PSD under transformation according to Equa- 
tion (|28|) . In black a double broken power-law input PSD, in red 
the output PSD after applying the transform. The power is given 
in arbitrary units. 



4.3. The structure function and the PSD 

Now we can investigate the time correlation within the 
random process. One way to do this is an investigation of 
the structure function, a running variance method that 
measures the mean value of the flux density var i ance for a 
given time separation r (jSimonetti et al.lfl985l : IDo et al.l 
2009]): 



V(T) = ([x(t + r)-x(t)] 2 ) 



(31) 



The structure function of our data sample (on its true 
cadence) is shown in Figure [HO We only considered 
time separations with more than 300 flux density pairs. 
Clearly the night-day gap between ~ 360 min and ~ 1200 
min (A) and the section with low density of data points 
beginning at ~ 7500 min due to the typical length of the 
observation runs (B) are visible. The structure function 
shows the expected tendency for a flat behavior at small 
r- values (white noise of the measurement), a steeply in- 
creasing, power-law like mid dle section, and a flat be- 
havio r at longer timescales (jDo et al.l 120091 : IMever et al.l 
2009). Interestingly the structure function also shows 
a number of features starting at ~ 25 min, which give 
the impression of a second break at this timescale. This 
timescale is of particular interest in the discussion of the 
role of physical processes close to the innermost stable 
orbit. 

The analysis of the structure function is problem- 
atic. A comprehensive study about the use and the 
caveats of structure fun c tion m ethods can be found in 
lEmmanoulopoulos et al.l (|2010f ). The authors point out 
that spurious breaks may occur for many realizations of 
random processes with even featureless PSDs, only re- 
flecting the interplay of the PSD of the underlying ran- 
dom process and the data length. This can be easily un- 
derstood, because timescales much longer than the data 
length (weakly stationary case) can define the sample 
average (then different from the average of the process) 
around which the shorter timescales (smaller than the 
data length) might var y with similar, repeated fluctu- 
ations (see Figure 4 in lEmmanoulopoulos et all 12010). 
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Fig. 16. — The structure function of the observed data sample 
(time binning 1.2 min). 

This makes the true average of the underlying random 
process an essential piece of information, which in our 
case can be easily inferred. Although we do not know 
the true distribution below the detection limit, the aver- 
age of the flux densities above this limit (2.3 ± 0.1 mJy 
including the background) is an upper limit to the true 
average value. Because the true average has to be greater 
than zero, its uncertainty is small in comparison to the 
values reached by many outbursts. Thus, the main fea- 
ture of the structure function, the flattening towards long 
timescales, is indeed an intrinsic feature. T his also is sup- 
ported by the fact that lMever et al . (2009) find zero per- 
centage of acceptan ce for single slope power-law P SDs. 

Other concerns of lEmmanoulopoulos et aTl (|2010f ) are 
more important for our case: The structure function 
values for different timescales are not independent and 
not Gaussian, and for broken intrinsic PSFs the break 
timescale can occur at systematically lower values, mak- 
ing usual fitting algorithms and their error estimation 
unsuited. Furthermore, the authors show that one may 
expect plenty of artificial features in the case of non- 
equally spaced data. While the latter concern indeed 
makes it necessary to carefully investigate the discussed 
features of our structure function at shorter timescales 
(compare the structure function for dense and spars e 
sampling in Figure 12 of lEmmanoulopoulos et al. 2010), 
th e former c a n be o vercome by the procedure i ntroduced 
bv lDo et all (|2009D (also se dMever et HI 120091 ). 

The steps are as follows: 

• Starting with a double broken power-law PSD of 
the form of Equation ([29]) we generate 5000 light 
curves (as described above) for a number of com- 
binations of the parameters (ai, Q2, c*3, / aj /&)• 
Each light curve has a length of 4 • 10 6 min and a 
sampling of 1 min and is re-sampled to the cadence 
of the observed data. 

• We calculate the structure function for each surro- 
gate light curve in the same way as for the observed 
data sample. 

• We define a goodness parameter for the comparison 
of the individual structure function with the "aver- 
age" structure function for each set of parameters. 
The probability of acceptance of a parameter set 
is defined as the percentage of the 5000 surrogate 



light curves that have a worse goodness- value with 
respect to the "average" structure function than 
the observed sample. 



Me ver et~aT1 (|2009[ ) used standard x 2_vames an d an 
arithmetic average of the structure functions for the es- 
timation of the acceptance. To account for a possible 
non- Gaussian distribution of the structure function val- 
ues for a g iven separation r, we prefer a modified x 2 
estimation ([Emmanoulopoulos et al.ll201Q[ ): 
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(32) 



with ct/c the standard variation of log[V( r /c)]- We use 
the logarithm to make the skewed distribution of each 
structure function value more "symmetric". In partic- 
ular the difference of the mean and the most probable 
value, which is a consequence of the skewness, is reduced 
making the modified x 2 a measure of the distance to 
the most probable rather than to the average structure 
function, as it is necessary for a maximum likelihood ap- 
proach. 

Since f a corresponds to much shorter timescales than 
the overall length of the observed light curve (3.6 • 10 6 
min), and the power-law slop e of the PSD at small fre- 
quencies is very flat (~ -0.3, Meye7eFaD[2009), it is not 
necessary to produce much longer light curves to avoid 
red-noise leakage. Also a higher sampling rate and a sub- 
sequent smoothing in order to simulate the effect of the 
detector integration does not change the results of our 
simulations. 

We first explored the parameter space by manual fit- 
ting and then defined the range of parameters, for which 
we set up the Monte Carlo simulation. For the struc- 
ture function we used time separation bins of 1.2 min 
and concentrated on the first 287 points (up to a time 
separation of ~ 340 min) for the estimation of the accep- 
ta nce values, us i ng th e constant slope of a\ = 0.3 found 
by lMever et al.l (|2009f ) for the long timescales. The well 
fitting combinations are constrained by the fact that the 
normalization step makes the choice of e.g. as depen- 
dent on the choice of f . Additionally, the differences 
between f a and should correspond to a timescale that 
still can be measured within a typical observation length 
(~ 130 min). Finally, the difference between a 2 and as 
should still be big enough to differentiate the double bro- 
ken PSD model from a single broken model. For a double 
broken power-law PSD we tested all combinations of the 
following parameter set: 



ot\ =0.3 

a 2 = 1.8/1.9/2.0 

a 3 = 2.5/2.8/3.3 

f a = 0.001/0.0017/0.0025 

f h = 0.0133/0.0333/0.0533 



(33) 



with f a and given in min 1 , and to obtain a finer 
2jid, additionally all combinations of 

ai =0.3 




a 2 = 1.85/1.9/1.95 

a 3 = 2.65/2.8/3.05 (34) 
f a = 0.0014/0.0017/0.002 
f b = 0.0233/0.0333/0.0433 . 

For a single broken power-law PSD we tested the con- 
binations of: 

a\ =0.3 

a 2 = 2.0/2.3/2.6/2.9 

f a = 0.002/0.004/0.008/0.01 . (35) 

We find the highest probability of acceptance of 96% 
for a double broken power-law with slopes of a 2 = 1.9 
and as = 3.3 and break timescales at = 20 min and 
f a = 590 min, respectively. Several combinations in the 
ranges of a 2 = 1.8 — 2.0, as = 2.5 — 3.3 for the slopes, and 
20-40 min and 500-700 min for the break timescales reach 
acceptance values higher than 90%. Considering the 
typical statistical fluctuations of the acceptance values 
of about 2 percentage points they can be considered as 
equivalent. In comparison the single broken PSD models 
with a break in the range of the 90% confidence interval 
of lMever et "all (|2009f ) (154t<^ 4 min) only reach 75% at 
maximum. On the other hand, the high acceptance val- 
ues are typically reached for values of /& = 0.0017/0.002 
and a 2 = 1.8 — 2.0, independent of the values for f a and 
as. Indeed, a single broken power-law PSD with a 2 = 2.0 
and f a = 0.002 (500 min) also reaches an acceptance level 
of 92%, making the question if the true PSD has more 
structure than a single break, undecidable on the base of 
the presented dataset. The behavior of all here discussed 
parameter sets for timescales longer than 500 min (not 
depicted here) match the measured structure function. 

In Figure [17] we show the most probable structure 



function estimated from 5000 surrogate datasets for a 
high probability double broken PSD model with break 
timescales at 30 min and 590 min (94% of acceptance) 
and the highly probable single broken PSD model (92% 
of acceptance, break at 500 min). Both are almost iden- 
tical. Additionally, we present the most probable struc- 
ture function for a single broken PSD model with a lower 
break timescale at 210 min and a probability of accep- 
tance of 68%. We can clearly see that all three structure 
functions show a flattening towards longer timescales be- 
ginning at about 25 min, independent of their dominant 
timescale or the shape of the PSD. This shows that the 
features in the observed structure function starting at 
~ 25 min are dominated by the influence of the win- 
dow function of the dataset and cannot be interpreted as 
intrinsic. 

Light curves generated with the double broken PSD 
with 92% acceptance and with the single broken PSD 
with 68% acceptance are shown in Figure [26] and Fig- 
ure [2Tj Figure [28] shows the comparison of long light 
curves for the high probability single and double broken 
cases, demonstrating their similarity on long timescales. 
Also on shorter timescales the high probability single bro- 
ken models generate light curves which are not obviously 
different from the high probability double broken cases. 
It is plausible that a difference that is difficult to find 
in time continuous data can not be significant in data 
with a sparse time coverage. Note that due to the domi- 
nant timescale in both light curves of about 500-600 min 
there are time intervals (sometimes even more than a 
day) where the brighter flares start from higher flux den- 
sity levels than the normal minimum level. In the case 
of a sparse data coverage this can lead to a misinterpre- 
tation of these minimum level differences as variability 
on long timescales, and from this point of view the inter- 
pretation of these differences as variations on timescales 
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of weeks and month given in iDodds-Eden et all (|2011[ ) is 
not the only possible explanation. 

We have to make two comments on error bars and con- 
fidence levels: First, there is no good method to provide 
error bars for the observed structure function, because 
it would require knowledge about the true PSD and its 
interaction with the window function of the observation. 
Additionally, the distribution of the single point in the 
structure function can be very skewed and it is ques- 
tionable whether e.g. a standard deviation over the MC 
light curves can be established as a good error estimate. 
Secondly, the break timescale of 500 min found is statis- 
tically equal to the value found by iMever et al.l ([2009) 
( which lies in our 90% confidence interval). However, 
confidence levels deduced from acceptance values take 
into account the probability that the observed sample is 
not representative for the true variability but a statis- 
tical "outlier", a possibility that, considering the cover- 
age fraction of 0.4%, makes any conclusion unreliable. 
For example, on a 90% confidence level, the light curves 
generated from PSDs with acceptance values of 68% and 
94%, respectively, (Fig. [28]) are indistinguishable. On the 
other hand, under the assumption that we actually are 
looking at a typical sample that represents the variabil- 
ity of a continuous 15000 min data piece quite well, we 
find an argument against the 68%-PSD: The generated 
light curve seems to show too many flares on the 20- 
30 mJy level. This demonstrates that the insignificance 
of a break at lower timescales due to identical accep- 
tance values is more fundamental than the criticism that 
we might look at an exotic realization of the underlying 
process, or that the analysis suffers from an accidental 
selection effect: With the cadence of our observations, 
especially with the day- night gap, we cannot decide on 
question whether a PSD-break at timescales between 25 
and 100 minutes is characteristic for the variability, even 
if we assume that our data sample is representative. 

As a last step we can use the 5000 re-sampled light 
curves with the best fitting structure function to test the 
plausib ility of the power-law assumption as described in 
section 13.21 (now taking into account the time correla- 
tion). We find a goo dness parameter of q — 0.79 (as 
defined in section 13.2 j) . much higher than the value of 
q = 0.2 for independent data, firmly establishing the 
plausibility for the power-law description of the proba- 
bility density. The observed CDF and the CDFs of 20 
randomly selected surrogate light curves are shown in 
Figure [THJ The values of the parameters in Equation ([T4]) 
can be confirmed also for the case of correlated data, the 
uncertainties are slightly bigger (0.15 mJy for xo and 
x m i n , and 0.3 for a). 

5. EXTREME FLUX DENSITY EXCURSIONS 

In this section we investigate the consequences of an ex- 
trapolation of the measured time correlated power-law to 
high flux density levels and long timescales. It is obvious 
that infinitely bright outbursts are unphysical, raising 
the question of a physical constraint for the maximum 
flux densities that can be expected. 

5.1. Maximum expected NIR flux density 

Recent measurements of the VLBI source Sgr A* at 
230 GHz resulted in a lower limit to the brightness tem- 
perature of T b = 2 x 10 10 K (jDoeleman et all 12008). For 




flux density [mJy] 

Fig. 18. — The observed CDF (black) of flux densities and 20 ran- 
domly selected CDFs of time correlated power-law surrogate data 
(colored dashed lines). The best fit CDF is shown as a continuous 
red line. 

the case of a synchrotron source the relations between 
observed and intrinsic flux densities and frequencies 

Sobs = 5 3 ~ a Si n t (36) 
v ohs = 8v int (37) 

^(l-Z^^i-Il-^-cos^)]- 1 (38) 

P = ~ , (39) 
c 

with v the bulk motion of the synchrotron region, <fi 
the viewing angle toward the emitting component, a 
the spectral slope, and Ui nt the self-absorption peak fre- 
quency in GHz, result in a brightness temperature (in- 
cluding the possibility of beaming): 



1.22 S b s 1.22 Sii 



(40) 



9 is the source diameter in milliarcseconds and X5 is the 
equiparti tion brightnes s temperature in units of 10 12 K 
(see lEckart et al.ll2012f ). Thus, for constant 6 and 5 the 
brightness temperature is proportional to the flux den- 
sity. From 345 GHz and 690 GHz SMA measurements, 
there is evidence that the self-absorption peak frequency 
of most synchrot ron source components in Sgr A* pe ak 
around 345 GHz dMarrone 2006; Marrone et aDl2QQ6al lbL 
see also lEckart et al.l 1201 2\ ). With the lower limit for 
the brightness temperature mentioned before, which has 
been obtained close to this peak frequency, we can cal- 
culate a flux density limit which extreme outbursts in 
the NIR must reach, assuming that the size of the lumi- 
nous region stays approximately constant and that the 
brightness temperature is linked to optically thin infrared 
emission. The maximum brightness of the synchrotron 
source Sgr A* is certainly given by the inverse Comp- 
ton limit of 10 12 K (or a few times 10 12 K in the case of 
boosting). With both constraints the maximum bright- 
ness of Sgr A* can be expected to be about 100 times 
brighter than regular flare amplitudes. Since these am- 
plitudes reach K-band flux densities of around 20 to 30 
mJy, extreme values about 3 Jy can be expected. Only 
for smaller source sizes and lower self-absorption frequen- 
cies, or higher bulk motions during the flare the expected 
extreme K-band flux density may be lower. 

5.2. Extreme X-ray flares in the past 
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The present X-ray luminosity of Sgr A* is more than 
10 orders of magnitude less than its Eddington lumi- 
nosity. The observation of hard X-ray emission and 
iron fluorescence from some of the massive molecular 
clouds surrounding the Galactic Center has been inter- 
preted as a light echo of a luminous pa st flare, that may 
have happened up to 400 years ago (Revnivtsev et al.l 
2001 [SumTaev fc Churazovl fl998l iTerrier et al.l l2oToh 7 
Terrier et al.l (j2010h report the observation of a clear de- 
cay of the hard X-ray emission from the molecular cloud 
Sgr B2 during the past seven years. They argue that this 
decay strengthens the case for such a bright flare in the 
past and significantly weakens the alternative explana- 
tions involving low-energy cosmic rays. 

The authors also argue that the luminosity of the event 
which may have been responsible for the observed light 
echo was 1.5-5xl0 39 erg s -1 . Capelli et al. (2012) 
present evidence for the fact that the luminosity of Sgr A* 
decreased during the past 150 years (in good agreement 
with Muno et al. 2007) with upwards and downwards 
trends lasting from a few to perhaps 10 years. 

If we assume that the infrared flares are linked to X- 
ray flares by the underlying radiation mechanism, it is 
interesting to compare the flux densities of the brightest 
X-ray events in the past with our infrared flare statistics 
that we obtained over the recent ~7 year period, even 
if the timescales of those events may have been signifi- 
cantly different. In this picture the short and long term 
variability would be due to the same underlying physi- 
cal process and the present time is close to an 'off-state' 
while the flare events lasting for a few to 10 years during 
the past 150-400 years corresponds to 'on-states' in the 
Sgr A* light curve. 

Lets us assume that such a bright flare in the X-ray 
domain occurred between 150 and 400 years ago. This 
means that its occurrence is between a factor 5 x 10 -2 and 
7 x 10 -5 less frequent than the brightest infrared flares 
observed until today (one 30 mJy event every 15000 min- 
utes - as observed - to one 30 mJy event every 7 years 
in the case that the observed flare was the brightest over 
the total period of seven years). The bulk of the X- 
ray flares obse rved in recent y ears lies between 10 34 to 
3x 10 35 erg s/s (jBaganoff et alJ2QQlLl2Q0llPorquet et al.l 
I2003L l2008[ ). Assuming an optically thin spectrum be- 
tween the NIR and X-ray domain, and assuming that 
the brightest K-band flares (of 10 to 30 mJy) sample the 
brightest X-ray flares, we can expect that the K-band 
flux density associated with the flare 150-400 years ago 
has been 0.05 x 10 5 to 5 x 10 5 times higher (correspond- 
ing to 50-15000 Jy) than what we have observed until 
today. This is 2 to 4 orders of magnitude above the K- 
band flux density limit we estimated for the case of the 
inverse Compton limit. Hence an optically thin NIR/X- 
ray flare can be excluded as the source of the light echo. 
However, the requirements on the K-band flare bright- 
ness are considerably reduced if we assume a synchrotron 
self- Compton (SSC) mechanism as a source of the bright 
X-ray flare. We assume a synchrotron radio spectrum 
with a turnover between the optically thick and the thin 
part at a frequency of v m (in GHz) and a flux density 
S m (in Jy) with an optically thin spectral index a 

S u ocu~ a . (41) 



Adapting the formulae given by [Marscher (fl983l [2009) 
for the case of the Galactic Center, we then find that the 
SSC X-ray flux density Sx,ssc ( m A^Jy)? is given by 

S x ,ssc=d(a)ln (^-) 9 -W )jy -(s a+5 ) 

xS^ a+ ^E- a S- 2 ^ , (42) 

where d(a) is a dimensionless parameter and Ex the X- 
ray photon energy in keV. Relativistic boosting, denoted 
by (5, may occur for anisotropic motion along a jet, or due 
to orbital motion close to the black hole, or due to a rapid 
isotropic expansion of a source component (see discus- 
sion bv lEckart etllll2012f ). With the S x ,ssc oc #m (a+2) 
dependency the 10 4 to 10 5 increase in X-ray luminosity 
can be achieved by an increase of the K-band flux den- 
sity by a factor of 5 to 12. H ere we used a synchro tron 
spec tral index a of 0.7 ±0.3 (|Hornste in et al.l 120071 see 
also iBremer et al.l |2011[ ) and assumed that source sizes 
and turnover frequencies are similar to what had been 
obtained for the recently observed flares. This indicates 
that the light echo could be produced by a 50 to 360 mJy 
flare that is a factor of 5 x 10 -2 to 7 x 10 -5 times less 
frequent than the recently observed brightest Ks-band 
flares. These are flux densities that are well below those 
derived for the inverse Compton limit. Assuming that 
the frequency of occurrence (measured in min -1 ) of peak 
flux densities of rare events is proportional to the fre- 
quency of flux densities in the flux density histogram 
(measured in mJy -1 ), and with a frequency of about 
10 -4 mJy -1 for flux densities around 30 mJy we obtain 
a range of 5 x 10 -6 - 7 x 10 -9 mJy -1 and 50-360 mJy in 
the frequency-flux density diagram. Fig. [19] shows that 
(within the uncertainties) these values are in agreement 
with the extrapolated probability density of the K-band 
flux densities as obtained over the past 7 years. Also 
the long light curves in Fig. [28] show that flares with 
amplitudes up to 100 mJy can be expected already for 
continuous light curves of 2 • 10 6 min, and for 5000 light 
curves with the observational cadence we find typically 
5-7 datasets with maximum flux densities over 250 mJy. 

While the presence of bright NIR flares may be in- 
ferred from the SgrA* flux distribution histogram, the 
occurrence of these flares and the flare length must 
be discussed in the framework of the PSD derived for 
SgrA*: Gonzales-Martin and S. Vaughan (2012) have in- 
vestigated the PSDs of 104 nearby AGNs of which 72% 
show strong variability in at least one XMM band and a 
high percentage of low-luminosity sources does not show 
a significant variability over the monitoring time span 
within the limited XMM lifetime. Only for 17 sources 
(15%) PSD breaks could be found which appear to fol- 
low a linear scaling relations of break timescales with 
the black hole mass. For 85% of the total number of 
sources no break time scale at all was found and es- 
pecially for the sources with significant variability the 
vast majority (77%) have not reported break timescale 
(Gonzales-Martin and S. Vaughan (2012) although their 
black hole masses certainly will compare well with the 
mass range covered by those few for which a break fre- 
quency was detected. This implies that the majority of 
sources shows a significant power at variability frequen- 
cies lower than the break frequencies expected for their 
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Fig. 19. — Illustration of the likelihood of extreme flux density excursions extrapolated from the statistics of the observed variability. The 
expected maximum flux density given by the inverse Compton catastrophe and an estimate of its uncertainty are shown as the blue circle, 
the SSC infrared flux density for a bright X-ray outburst as expected from the observed X-ray echo is depicted as the red rectangular. The 
lower limit for the NIR flux density in the case of a pure synchrotron model is indicated with the black arrow. 



specific black hole masses (if 17 sources are regarded as 
representative). From our present knowledge of variabil- 
ity of galactic nuclei it is unclear if a source for which 
a PSD break frequency has been detected over a typi- 
cal observing timescale of only lelO years, does always 
have to show such a break frequency. Given the current 
statistics (Gonzales-Martin & Vaughan (2012) we may 
in general assume that, if sampled at other times, these 
sources are more likely to show variability characteristics 
closer to that of the majority of the sources. In that case 
150 to 400 years ago Sgr A* may very well have shown 
significant power at variability frequencies lower than the 
break frequencies. The required flare lengths of a few to 
about 10 years (Capelli et al. 2012) are short compared 
to this time span. 

In addi t ion, e xtraordinary events may occur: 
iFrei tag (2003) found that for Milky Way type galaxies 
the probability of an interaction between a super mas- 
sive black hole and a main-sequence star (MSS), a white 
dwarf (WD) and a neutron-star or stellar black hole 
(NS/SBH) is of the order of 10" 5 , 10" 7 and 10" 8 events 
per year, respectively. The orbital decay and a possi- 
bly significant mass transfer may last for several months 
or even years before the actual merger (e.g. Capozziello 
et al. 2009), such that as a consequence significantly 
enhanced accretion rates and flares are likely to be ob- 
served. It is also speculated that these merger events 
actually contribute significantly to the mass content of 
SMBH accretion disks. The flux densities associated 
with these merger events will be appreciable (two cases 
for which this phenomenon has rece ntly been discussed 
are Swift J2058.4+ 0516 at z=1.2, iCenko et al.l [20TTI : 
iKrolik fe Pimnl[20TTI : and Swift J1644+57 at z=0.3, Met- 
zger et al. 2012). It can also not be excluded that an 
increase in the accretion rate and therefore an increase in 
luminosity can be caused by encounters with members of 
the central stellar cluster as proposed by iGillessen et al.l 



(j2012[ ). It needs to be investigated how fast a stationary 
process can recover from such events. 

6. CONCLUSION 
We summarize our results: 

• The Ks-band flux density distribution of Sgr A* as 
obtained from the last seven years of observations is 
in convincing agreement with a pure power-law dis- 
tribution, giving no indication for a break or two- 
state behavior. 

• We could find an upper limit of the intrinsic mean 
flux density of about 1.7 ± 0.15 mJy, and with a 
power-law extrapolation to flux densities below the 
detection limit a mean of 0.9 ±0.15 mJy (including 
extinction correction). 

• We found an algorithm to statistically simulate 
light curves that show the same flux density distri- 
bution and time correlation as t he observed sample. 
It is b ased on the algorithm by iTimmer fc Ko enig 
(1995) to generate linearly time-correlated surro- 
gate samples, but includes a transformation to ac- 
count for the non-linear appearance of the NIR flux 
densities of Sgr A*. In a first approximation broken 
power-law PSDs are invariant under this transfor- 
mation. This statistical model (best fitting PSD, 
flux density distribution and the algorithm) does 
not provide immediate information on the physi- 
cal system, but serves as a statistical "summary" 
of the observed variability, defining constrains for 
every physical model. Furthermore, it allows a 
straight forward power-law extrapolation to higher 
flux density levels, flux density levels below the de- 
tection limit, and long timescales. 

• This extrapolation demonstrates that high flux 
density excursions, as required to explain the sup- 
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posed X-ray light echo in the molecular clouds sur- 
rounding Sgr A*, are well within the expected sta- 
tistical extreme values of the flux density distribu- 
tion that we observed at much lower flux densities. 
This implies that even if we include the flare re- 
sponsible for the X-ray echo as the most extreme 
event that is suggested by observations there is no 
evidence for a two-state variability behavior. 

• Within our statistical model the concept of a "qui- 
escent state" and the differentiation between con- 
tinu ous variability and "off" states (as investigated 
by iDodds-Eden et all l201lf ) turn out to be prob- 
lematic: In this description, Sgr A* is always vari- 
able and the probability to find it at a flux density 
level of zero is actually zero. But any flux den- 
sity interval starting with zero is represented with 
a higher probability than any other interval of the 
same length, allowing for arbitrarily faint flux den- 
sity states. So from an observational point of view, 
this model predicts "off" states due to the instru- 
ment dependent limited resolution and sensitivity. 
In the case of Ks-band observations with NACO the 
detection limit is very close to the mean flux den- 
sity of the intrinsic distribution. This means that 
the distribution of about half of the intrinsic flux 
density states can not be accessed with our current 
resolutions and sensitivity. Next generation instru- 
ments like LINC-NIRVANA at the Large Binocular 
Telescope and GRAVITY at the VLTI will allow to 
probe the lower flux density range. 

• The question whether timescales comparable to the 
orbital timescale at the innermost stable orbit play 
a role for the variability is principally undecidable 
on the base of this data. In order to access this 
problem a significant amount of continuous light 
curves with a length of more than 1500 min would 
be needed. This, however, does not exclude the 



presence of or bital signatures in polarim etric data 
as reported in IZamaninasab et all (j2010f ) . Here we 
are investigating all light curves in total under the 
assumption of stationarity. Thus, we cannot make 
statements on a possible time development of the 
dominant timescale or the role of shorter timescales 
as a transient phenomenon. 
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APPENDIX 
TWO STATE PROBABILITY MODEL 

As a description for the observed distribution of the flux density x (measured in mJy) iDodds-Eden et all ([20111 ) 



suggest the following probability density: 



0+errV 



(x) = / Po(x') 



27ra ohs (x) 



exp 



-(x-x'f 



_ 2a ohs (x) 2 _ 



dx' 



i.e., a convolution of an Gaussian error with flux density-dependent width 

0.5 



^obs 

mJy 



0.174 



x 



mJy 



and an intrinsic two state probability density: 

( kP\ ogn (x) : x<x b + x t 

P ° (X) = ( kP lom {x t + x h )(^y S : x>x t + x b 

with k a dimensionless normalizing factor, and the log- normal part defined as 



2,71(7 * (x — Xb) 



exp 



In 



I X-Xb \ 

\ mJy J 



2<T? 



(Al) 



(A2) 



(A3) 



(A4) 



The parameters of their best fit model (x 2 /dof = 2.99) are <7* = 0.75 ± 0.05, /i* = 0.05 ± 0.06, s = 2.7 ± 0.14, 
Xt = 4.6 ± 0.5 mJy, and Xb = 3.59 ± 0.06 mJy (the unit of the probability density is mJy -1 ). 
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DATA QUALITY 

Here we give an overview about the observation conditions. All plots show the data after quality cut. 
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Fig. 20. — Time series of quality parameters. We show the airmass during observation, the seeing values obtained by a measurement on 
the active optics guide star, the atmosphere co here nce to (not available for all frames), and the Strehl ratio obtained from the extracted 
PSF. For a histogram representation see Figure [21] 
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Fig. 21. — Histograms of the guide star seeing, the Strehl ratio, and the atmospheric coherence time. 
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Fig. 22. — The left panel sho ws the integration times used for our data sample, the right a histogram of the average calibrator flux densities 
(see description in section [2?2]) . 
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LIGHT CURVES 

Here we show a more detailed presentation of the data of Figure [2] (blue points). Each box represents a continuous 
data piece without interruption longer than 30 min. Additionally to the data of Sgr A* we show the light curves of the 
calibrator S7 (green), of the average calibrator flux density (red), and the average background apertures (b-apertures 
in Figure [TJ grey). Please note that the x-axis is scaled differently, and in the case of the very bright flare of August 
5th 2008 the y-axis as well. The missing points of S7 are caused by the rejection algorithm described in section 12.21 
Since it is very difficult to estimate a reliable error for the individual point due to the changing correction conditions 
of the AO system and its interplay with the extended background, the confusion and the deconvolution algorithm, 
we did not include error bars. The point to point scatter of the comparison star and the calibration can serve as an 
estimate for the individual dataset. The overall error statistics are described in section 13. 2\ and in average we find 
a Gaussian error of about a = 0.3 mJy. Furthermore, we present a Table (Tab. [Cj of all datasets included in this 
analysis with all important information, including average sampling, length of dataset, and maximum flux density. 
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, — Light curves of Sgr A* (blue), S7 (green), average calibrator flux density (red), and background. 
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Note. — This table shows basic information for each data set shown in the Figures before: start-stop times, length of the data set, number of frames, maximum flux density occurring in the 
set, average time sampling over the set, the used integration time, the observational mode (imaging or polarimetric), the camera (S13 with 13" FOV or S27 with 27"), and the project ID of the 
observations. Note that start-stop times, frame numbers and length average are given for the datasets before applying the quality cut, average sampling and maximum flux density after the cut. 
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FLUX DENSITY STATISTICS 

In this Appendix we present supplement information for the statistical analysis described in section 13.11 We present 
histograms of the Ks-Band flux densities of Sgr A* with different than the optimal binning, showing that the linear 
behavior of the histogram is not binning dependent, and the CDF s of independently drawn power-law distributed 
surrogate data in comparison to the observed CDF (see section lcT2]) . 
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Fi g. 24. — The observed flu x densities in histograms with bin numbers higher (upper left, 45 bins) and lower (upper right, 22, com- 
pare Dodds-Eden et al. 2011) than optimal (32). The latter we also show in a integration time weighted version (as conducted by 
Dodds-Eden et al. 2011), finding now significant difference (lower plot). 
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Fig. 2 5. — Comparison of the observed CDF with surrogate data drawn independently from a power-law according to Equation (|14|) 
(section |3.2|) . Although the most obvious differences seem to be at highest flux densities, this impression is caused by the logar ithm ic 
presentation (left). The KS statistics used for determining the best x m i n and the plausibility of a power-law model in section 13.21 is 
dominated by differences at lower flux densities as demonstrated in the plots on the right, showing the comparison of the observed CDF 
and a surrogate CDF with a worse Kolmogorov- value than the observed data (upper right: CDFs in linear plotting, lower right: difference 
of the CDFs). 
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GENERATING FOURIER TRANSFORM BASED SURROGATE DATA 



Examples of red- noise light curves obtained by transformation according to Equation ([28]) . The transform was 
applied to a linear Gaussian light curve with single and double broken power-law PSDs as discussed in section l~T~3l 
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Fig. 26. — Simulated lightcurves taken from a 4 • 10 6 min time series created with the best double-broken power-law PSD. The upper 
panel shows typical 15 000 min, the lower panels 500 min closeups with lower and higher flux density l evel s. Light curves created with the 
best single broken PSD show to a first order the same behavior as the presented case (compare Figure [28)) . 
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Fig. 27. — Simulated lightcurves taken from a 4 • 10 6 min ti me s eries created with a single-broken power-law PSD with an acceptance 
of 68%, corresponding to the blue structure function in Figure [TT1 The upper panel shows typical 15 000 min, the lower panels 500 min 
closeups with lower and higher flux density levels. 
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Fig. 28. — Long term appearance of light curves generated with the best single (blue) and the best double (magenta) broken PSD. On a 
time period of 2 • 10 6 min no obvious difference is noticeable. 



